Using Topology Preservation Measures for Multidimensional Intelligent Data Analysis in the Reduced Feature Space
نویسندگان
چکیده
This paper investigates a possibility of supplementing standard dimensionality reduction procedures, used in the process of knowledge extraction from multidimensional datasets, with topology preservation measures. This approach is based on an observation that not all elements of an initial dataset are equally preserved in its low-dimensional embedding space representation. The contribution first overviews existing topology preservation measures, then their inclusion in the classical methods of exploratory data analysis is being discussed. Finally, some illustrative examples of presented approach in the tasks of cluster analysis and classification are being given.
منابع مشابه
A Geometry Preserving Kernel over Riemannian Manifolds
Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...
متن کاملTopology Preservation Measures in the Visualization of Manifold-Type Multidimensional Data
Most of real-life data are not often truly high-dimensional. The data points just lie on a low-dimensional manifold embedded in a high-dimensional space. Nonlinear manifold learning methods automatically discover the low-dimensional nonlinear manifold in a high-dimensional data space and then embed the data points into a low-dimensional embedding space, preserving the underlying structure in th...
متن کاملSelecting effective features from Phonocardiography by Genetic Algorithm based on Pearson`s Coefficients Correlation
The heart is one of the most important organs in the body, which is responsible for pumping blood into the valvular systems. Beside, heart valve disorders are one of the leading causes of death in the world. These disorders are complications in the heart valves that cause the valves to deform or damage, and as a result, the sounds caused by their opening and closing compared to a healthy heart....
متن کاملSupervised Feature Extraction of Face Images for Improvement of Recognition Accuracy
Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...
متن کاملOnline Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کامل